Intelligibility predictions for speech against fluctuating masker
نویسندگان
چکیده
The effect of masking due to fluctuating sources on speech intelligibility is a phenomenon difficult to predict. Intelligibility scores vary with the efficiency of the energetic masking while the linguistic content of the message and listener’s cognitive performances add to the general incertitude that peaks for the case of masking speech. The present contribution proposes a signal-based assessment of the energetic masking at the sentence level. A mapping onto the scale of the speech intelligibility index is established for stationary noise. Predictions are quantitatively compared with the results of an intelligibility test for speech-modulated noise. The model is independent of voices similarities and semantic features, two important sources of informational masking.
منابع مشابه
Predicting Binaural Speech Intelligibility from Signals Estimated by a Blind Source Separation Algorithm
State-of-the-art binaural objective intelligibility measures (OIMs) require individual source signals for making intelligibility predictions, limiting their usability in real-time online operations. This limitation may be addressed by a blind source separation (BSS) process, which is able to extract the underlying sources from a mixture. In this study, a speech source is presented with either a...
متن کاملInformation-preserving temporal reallocation of speech in the presence of fluctuating maskers
How can speech be retimed so as to maximise its intelligibility in the face of competing speech? We present a general strategy which modifies local speech rate to minimise overlap with a known fluctuating masker. Continuous time-scale factors are derived in an optimisation procedure which seeks to minimise overall energetic masking of the speech by the masker while additionally unmasking those ...
متن کاملEffects of linear and nonlinear speech rate changes on speech intelligibility in stationary and fluctuating maskers.
Algorithmic modifications to the durational structure of speech designed to avoid intervals of intense masking lead to increases in intelligibility, but the basis for such gains is not clear. The current study addressed the possibility that the reduced information load produced by speech rate slowing might explain some or all of the benefits of durational modifications. The study also investiga...
متن کاملEvaluating a distortion-weighted glimpsing metric for predicting binaural speech intelligibility in rooms
A distortion-weighted glimpse proportion metric (BiDWGP) for predicting binaural speech intelligibility were evaluated in simulated anechoic and reverberant conditions, with and without a noise masker. The predictive performance of BiDWGP was compared to four reference binaural intelligibility metrics, which were extended from the Speech Intelligibility Index (SII) and the Speech Transmission I...
متن کاملOverlap behaviour in task-oriented dialogue
Speakers change the way they speak depending on the surrounding environment. When masking noise obstructs the communication channel between interlocutors, they consistently engage in Lombard speech, whose spectral characteristics are well described (e.g. [1, 2, 3]) and are believed to result in better intelligibility through energetic masking reduction (e.g. [4]). However, less is known about h...
متن کامل